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Abstract — We consider information dissemination in a large 
n-user wireless network in which k users wish to share a 
unique message with all other users. Each of the n users only 
has knowledge of its own contents and state information; this 
corresponds to a one-sided push-only scenario. The goal is to 
disseminate all messages efficiently, hopefully achieving an order- 
optimal spreading rate over unicast wireless random networks. 
First, we show that a random-push strategy - where a user 
sends its own or a received packet at random - is order-wise 
suboptimal in a random geometric graph: specifically, Q. {yfri) 
times slower than optimal spreading. It is known that this 
gap can be closed if each user has "full" mobility, since this 
effectively creates a complete graph. We instead consider velocity- 
constrained mobility where at each time slot the user moves 
locally using a discrete random walk with velocity v(n) that 
is much lower than full mobility. We propose a simple two- 
stage dissemination strategy that alternates between individual 
message flooding ("self promotion") and random gossiping. We 
prove that this scheme achieves a close to optimal spreading 
rate (within only a logarithmic gap) as long as the velocity 
is at least v(n) — cj(^/logn/k). The key insight is that the 
mixing property introduced by the partial mobility helps users 
to spread in space within a relatively short period compared 
to the optimal spreading time, which macroscopically mimics 
message dissemination over a complete graph. 

Index Terms — Gossip algorithms, information dissemination, 
mobility, wireless random networks 



I. Introduction 

In wireless ad hoc or social networks, a variety of sce- 
narios require agents to share their individual information or 
resources with each other for mutual benefits. A partial list 
includes file sharing and rumor spreading |2)-|5), distributed 
computation and parameter estimation |[6[-|[T0[, and schedul- 
ing and control fTT) , | fl"2) . Due to the huge centralization 
overhead and unpredictable dynamics in large networks, it 
is usually more practical to disseminate information and ex- 
change messages in a decentralized and asynchronous manner 
to combat unpredictable topology changes and the lack of 
global state information. This motivates the exploration of 
dissemination strategies that are inherently simple, distributed 
and asynchronous while achieving optimal spreading rates. 
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A. Motivation and Related Work 

Among distributed asynchronous algorithms, gossip algo- 
rithms are a class of protocols which propagate messages 
according to rumor-style rules, initially proposed in |13|. 
Specifically, suppose that there are k < n distinct pieces 
of messages that need to be flooded to all n users in the 
network. Each agent in each round attempts to communicate 
with one of its neighbors in a random fashion to disseminate 
a limited number of messages. There are two types of push- 
based strategies on selecting which message to be sent in 
each round: (a) one-sided protocols that are based only on the 
disseminator's own current state; and (b) two-sided protocols 
based on current states of both the sender and the receiver. 
Encouragingly, a simple uncoded one-sided push-only gossip 
algorithm with random message selection and peer selection 
is sufficient for efficient dissemination in some cases like a 
static complete graph, which achieves a spreading time of 
(k log n) Q within only a logarithmic gap with respect to 
the optimal lower limit 0(fc) ]14|-p6). This type of one-sided 
gossiping has the advantages of being easily implementable 
and inherently distributed. 

Since each user can receive at most one message in any 
single slot, it is desirable for a protocol to achieve close 
to the fastest possible spreading time (k) (e.g. within a 
polylog(rt) factor). It has been pointed out, however, that 
the spreading rate of one-sided random gossip algorithms 
is frequently constrained by the network geometry, e.g. the 
conductance of the graph fl3) , (T7J. For instance, for one-sided 
rumor-style all-to-all spreading (i.e. k = n), the completion 
time T is much lower in a complete graph (T — O (nlogn)) 
than in a ring (T = fi(n 2 )). Intuitively, since each user can 
only communicate with its nearest neighbors, the geometric 
constraints in these graphs limit the location distribution of all 
copies of each message during the evolution process, which 
largely limits how fast the information can flow across the 
network. In fact, for message spreading over static wireless 
networks, one-sided uncoded push-based random gossiping 

can be quite inefficient: specifically up to £1 



poly(logn) 

times slower than the optimal lower limit (k) (i.e. a poly 
nomial factor away from the lower bound), as will be shown 
in Theorem [2] 

Although one-sided random gossiping is not efficient for 
static wireless networks, it may potentially achieve better 
performance if each user has some degree of mobility - an 
intrinsic feature of many wireless and social networks. For 

'The standard notion f(n) = ui (g(n)) means lim g(n)/f(n) = 0; 

n— >oo 

f(n) = o(g(n)) means lim f(n)/g(n) = 0; f[n) = Q(g(n)) means 3 

n — yoo 

a constant c such that f(n) > cg(n); f(n) = O {g(n)) means 3 a constant 
c such that /(n) < cg(n); / (n) = (g{n)) means 3 constants ci and C2 
such that cig(n) < f(n) < C2</(n). 
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instance, full mobilitjr] changes the geometric graph with 

transmission range O \ to a complete graph in the 

unicast scenario. Since random gossiping achieves a spreading 
time of 0(nlog(n)) for all-to-all spreading over a complete 
graph (15} , (16) , this allows near-optimal spreading time to 
be achieved within a logarithmic factor from the fundamental 
lower limit (n). However, how much benefit can be obtained 
from more realistic mobility - which may be significantly 
lower than idealized best-case full mobility - is not clear. 
Most existing results on uncoded random gossiping center on 
evolutions associated with static homogeneous graph structure 
or a fixed adjacency matrix, which cannot be readily extended 
for dynamic topology changes. To the best of our knowledge, 
the first work to analyze gossiping with mobility was |18|, 
which focused on energy-efficient distributed averaging instead 
of time-efficient message propagation. Another line of work 
by Clementi et. al. investigate the speed limit for information 
flooding over Markovian evolving graphs (e.g. [ 19]— [21]), but 
they did not study the spreading rate under multi-message 
gossip. Recently, Pettarin et. al. (22) explored the information 
spreading over sparse mobile networks with no connected 
components of size Q (logn), which does not account for the 
dense (interference-limited) network model we consider in this 
paper. 

For a broad class of graphs that include both static and 
dynamic graphs, the lower limit on the spreading time can 
be achieved through random linear coding where a random 
combination of all messages are transmitted instead of a 
specific message (23}, or by employing a two-sided protocol 
which always disseminates an innovative message if possible 
|16|. Specifically, through a unified framework based on dual- 
space analysis, recent work [24 1 demonstrated that the optimal 
all-to-all spreading time 0(n) can be achieved for a large 
class of graphs (24) including complete graphs, geometric 
graphs, and the results hold in these network models even 
when the topology is allowed to change dynamically at each 
time. However, performing random network coding incurs very 
large computation overhead for each user, and is not always 
feasible in practice. On the other hand, two-sided protocols 
inherently require additional feedback that increases commu- 
nication overhead. Also, the state information of the target may 
sometimes be unobtainable due to privacy or security concerns. 
Furthermore, if there are k <C y/n messages that need to 
be disseminated over a static uncoordinated unicast wireless 
network or a random geometric graph with transmission radius 

( y^poly (log n)J , neither network coding nor two-sided 
protocols can approach the lower limit of spreading time 
0(fc). This arises due to the fact that the diameter of the 

underlying graph with transmission range ' - / P ol y( lo g ") 



to be relayed through Q, 



hops in order to reach 



scales as 



poly (logn) 



and hence each message may need 



2 By full mobility, we mean that the location of the mobile is distributed 
independently and uniformly random over the entire network over consecutive 
time-steps (i.e., the velocity of the mobile can be "arbitrarily large"). This is 
sometimes also referred to in literature as the i.i.d. mobility model. In this 
paper, we consider nodes with "velocity-limited" mobility capability. 



polyflogn 

the node farthest to the source. 

Another line of work has studied spreading scaling laws 
using more sophisticated non-gossip schemes over static wire- 
less networks, e.g. (251, (26). Recently, Resta et. al. |27| 



began investigating broadcast schemes for mobile networks 
with a single static source constantly propagating new data, 
while we focus on a different problem with multiple mobile 



sources each sharing distinct message. Besides, [27] analyzed 
how to combat the adverse effect of mobility to ensure the 
same pipelined broadcasting as in static networks, whereas 
we are interested in how to take advantage of mobility to 
overcome the geometric constraints. In fact, with the help 
of mobility, simply performing random gossiping - which is 
simpler than most non-gossip schemes and does not require 
additional overhead - is sufficient to achieve optimality. 

Finally, we note that gossip algorithms have also been 
employed and analyzed for other scenarios like distributed av- 
eraging, where each node is willing to compute the average of 
all initial values given at all nodes in a decentralized manner, 
e.g. (6), (8). The objective of such distributed consensus is to 
minimize the total number of computations. It turns out that 
the convergence rates of both message sharing and distributed 
averaging are largely dependent on the eigenvalues or, more 
specifically, the mixing times of the graph matrices associated 
with the network geometry (8), (15). 



B. Problem Definition and Main Modeling Assumptions 

Suppose there are n users randomly located over a square of 
unit area. The task is to disseminate k < n distinct messages 
(each contained in one user initially) among all users. The 
message spreading can be categorized into two types: (a) 
single-message dissemination: a single user (or 0(1) users) 
wishes to flood its message to all other users; (b) multi- 
message dissemination: a large number k(k 1) of users 
wish to spread individual messages to all other users. We note 
that distinct messages may not be injected into the network 
simultaneously. They may arrive in the network (possibly 
in batches) sequentially, but the arrival time information is 
unknown in the network. 

Our objective is to design a gossip-style one-sided algorithm 
in the absence of coding, such that it can take advantage of the 
intrinsic feature of mobility to accelerate dissemination. Only 
the "push" operation is considered in this paper, i.e. a sender 
determines which message to transmit solely based on its own 
current state, and in particular not using the intended receiver's 
state. We are interested in identifying the range of the degree 
of mobility within which our algorithm achieves near-optimal 
spreading time O (k poly (logn)) for each message regardless 
of message arrival patterns. Specifically, our MOBILE PUSH 
protocol achieves a spreading time O {k log 2 n) as stated in 
Theorem [3] for the mobility that is significantly lower than the 
idealized full mobility. As an aside, it has been shown in [ 16 1, 
(23} that with high probability, the completion time for one- 
sided uncoded random gossip protocol over complete graphs 
is lower bounded by ft (k log n), which implies that in general 
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Table I 
Summary of Notation 

velocity 

the number of subsquares; rn = l/v 2 (n) 

the number of users/nodes 

the number of distinct messages 

the message of source i 

subsquare k 

the number, and the set of nodes containing 
Mi at time t 

the number, and the set of nodes containing 

Mi at subsquare at time t 

the number, and the set of messages node i 

has at time t 

path loss exponent 

SINR requirement for single-hop success 



the logarithmic gap from the universal lower limit 8 (k) 
cannot be closed with uncoded one-sided random gossiping. 

Our basic network model is as follows. Initially, there are 
n users uniformly distributed over a unit square. We ignore 
edge effects so that every node can be viewed as homogeneous. 
Our models and analysis are mainly based on the context of 
wireless ad hoc networks, but one can easily apply them to 
other network scenarios that can be modeled as a random 
geometric graph of transmission radius (^^log n/nj . 

Physical-Layer Transmission Model. Each transmitter em- 
ploys the same amount of power P, and the noise power 
density is assumed to be r\. The path-loss model is used such 
that node j receives the signal from transmitter i with power 
Pr~", where denotes the Euclidean distance between i 
and j with a > 2 being the path loss exponent. Denote by 
T(i) the set of transmitters at time t. We assume that a packet 
from transmitter i is successfully received by node j at time 
t if 



SINRij(f) 



Pr~ a 



v + E Pr kj 



> P, 



(1) 



where SINRyft) is the signal-to-interference-plus-noise ratio 
(SINR) at j at time t, and (3 the SINR threshold required 
for successful reception. For simplicity, we suppose only one 
fixed-size message or packet can be transmitted for each 
transmission pair in each time instance. 

Suppose that each node can move with velocity v(n) in 
this mobile network. We provide a precise description of the 
mobility pattern as follows. 

Mobility Model. We use a mobility pattern similar to |28 
Section VIII], which ensures that at steady state, each user 
lies in each subsquare with equal probability. Specifically, we 
divide the entire square into m := l/v 2 (n) subsquares each 
of area v 2 (n) (where v(n) denotes the velocity of the mobile 
nodes), which forms a y/mx y/m discrete torus. At each time 
instance, every node moves according to a random walk on the 
Jra x •Jrh discrete torus. More precisely, if a node resides in 
a subsquare € {1, • • • , \Jrn\ at time t, it may choose to 
stay in or move to any of the eight adjacent subsquares 
each with probability 1/9 at time t + 1. If a node is on the 
edge and is selected to move in an infeasible direction, then 
it stays in its current subsquare. The position inside the new 



subsquare is selected uniformly at random. See Fig. [T] for an 
illustration. 

We note that when v{n) = 1/3 = 0(1), the pattern reverts 
to the full mobility model. In this random-walk model, each 
node moves independently according to a uniform ergodic 
distribution. In fact, a variety of mobility patterns have been 
proposed to model mobile networks, including i.i.d. (full) 
mobility |29|, random walk (discrete-time) model [30|, |31|, 
and Brownian motion (continuous-time) pattern |32|. For 
simplicity, we model it as a discrete-time random walk pattern, 
since it already captures intrinsic features of mobile networks 
like uncontrolled placement and movement of nodes. 

subsquare of size v 2 (n) 
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4m = l/v(n) edges 

Figure 1. The unit square is equally divided into m = l/v 2 (n) subsquares. 
Each node can jump to one of its 8 neighboring subsquares or stay in its 
current subsquare with equal probability 1 /9 at the beginning of each slot. 



C. Contributions and Organization 

The main contributions of this paper include the following. 

1) Single-message dissemination over mobile networks. 

We derive an upper bound on the single-message (k = 
(1)) spreading time using push-only random gossiping 
(called RANDOM PUSH) in mobile networks. A gain 
of fi (v{n)\/rL/ log 2 n)) in the spreading rate can be ob- 
tained compared with static networks, which is, however, 
still limited by the underlying geometry unless there is 
full mobility. 

2) Multi-message dissemination over static networks. 

We develop a lower bound on the multi-message spread- 
ing time under RANDOM PUSH protocol over static 
networks. It turns out that there may exist a gap as large 



as Q, 



( \ 

^poly(logn) J 



between its spreading time and the 



,poly(logn) / 

optimal lower limit (fc). The key intuition is that the 
copies of each message Mi tend to cluster around the 
source i at all time instances, which results in capacity 
loss. This inherently constrains how fast the information 
can flow across the network. 
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3) Multi-message dissemination over mobile networks. 

We design a one-sided uncoded message-selection strat- 
egy called MOBILE PUSH that accelerates multi- 
message spreading (k — w(logn)) with mobility. An 
upper bound on the spreading time is derived, which is 

the main result of this paper. Once v(n) = u> ftj ^p 1 

(which is still significantly smaller than full mobility), 
the near-optimal spreading time O {k log 2 n) can be 
achieved with high probability. The underlying intuition 
is that if the mixing time arising from the mobility model 
is smaller than the optimal spreading time, the mixing 
property approximately uniformizes the location of all 
copies of each message, which allows the evolution to 
mimic the propagation over a complete graph. 
The remainder of this paper is organized as follows. In Section 
[IT] we describe our unicast physical-layer transmission strat- 
egy and two types of message selection strategies, including 
RANDOM PUSH and MOBILE PUSH. Our main theorems 
are stated in Section [IT] as well, with proof ideas illustrated in 
Section [3] Detailed derivation of auxiliary lemmas are deferred 
to the Appendix. 

II. Strategies and Main Results 

The strategies and main results of this work are outlined in 
this section, where only the unicast scenario is considered. The 
dissemination protocols for wireless networks are a class of 
scheduling algorithms that can be decoupled into (a) physical- 
layer transmission strategies (link scheduling) and (b) message 
selection strategies (message scheduling). 

One physical-layer transmission strategy and two message 
selection strategies are described separately, along with the 
order-wise performance bounds. 

A. Strategies 

1) Physical-Layer Transmission Strategy: In order to 
achieve efficient spreading, it is natural to resort to a decentral- 
ized transmission strategy that supports the order-wise largest 
number (i.e. 0(n)) of concurrent successful transmissions 
per time instance. The following strategy is a candidate that 
achieves this objective with local communication. 



we can see (which we omit here) that there exists a constant 
c such that 



UNICAST Physical-Layer Transmission Strategy: 

• At each time slot, each node i is designated as a 
sender independently with constant probability 8, 
and a potential receiver otherwise. Here, 8 < 0.5 
is independent of n and k. 

• Every sender i attempts to transmit one message to 
its nearest potential receiver 



This simple "link" scheduling strategy, when combined with 
appropriate push-based message selection strategies, leads to 
the near-optimal performance in this paper. We note that 
the authors in (29), by adopting a slightly different strategy 
in which 8n nodes are randomly designated as senders (as 
opposed to link-by-link random selection as in our paper), 
have shown that the success probability for each unicast pair 



(SINR ljW (t) >$) >c 



(2) 



holds for our strategy. Here, c is a constant irrespective of n, 
but may depend on other salient parameters P, a and r\. That 
said, (n) concurrent transmissions can be successful, which 
is order-optimal. For ease of analysis and exposition, we fur- 
ther assume that physical-layer success events are temporally 
independent for simplicity of analysis and exposition. Indeed, 
even accounting for the correlation yields the same scaling 
results, detailed in Remark 1. 

Remark 1. In fact, the physical-layer success events are 
correlated across different time slots due to our mobility 
model and transmission strategy. However, we observe that 
our analysis framework would only require that the transmis- 
sion success probability at time t + 1 is always a constant 
irrespective of n given the node locations at time t. To 
address this concern, we show in Lemma [T] that for any 
m < n/ (32 log n), the number of nodes Na ( residing in each 
subsquare A, is bounded within ^] with probability at 
least 1 — 2?i~ 3 . Conditional on this high-probability event that 
€ [~6m ' 3m] w ^ tn a ^ noc l es m eacn subsquare uniformly 
located, we can use the same proof as (29] Theorem III-5] to 
show that P (SINR, i J (, i )(t) > (3) > c holds for some constant 
c. 

Although this physical-layer transmission strategy supports 
(n) concurrent local transmissions, it does not tell us 
how to take advantage of these resources to allow efficient 
propagation. This will be specified by the message-selection 
strategy, which potentially determines how each message is 
propagated and forwarded over the entire network. 

2) Message Selection Strategy: We now turn to the ob- 
jective of designing a one-sided message-selection strategy 
(only based on the transmitter's current state) that is efficient 
in the absence of network coding. We are interested in a 
decentralized strategy in which no user has prior information 
on the number of distinct messages existing in the network. 
One common strategy within this class is: 



RANDOM PUSH Message Selection Strategy: 

• In every time slot: each sender i randomly selects 
one of the messages it possesses for transmission. 



is a constant. Using the same proof as for |29 Theorem III-5], 



This is a simple gossip algorithm solely based on random 
message selection, which is surprisingly efficient in many 
cases like a complete graph. It will be shown later, however, 
that this simple strategy is inefficient in a static unicast wire- 
less network or a random geometric graph with transmission 

range 

In order to take advantage of the mobility, we propose the 
following alternating strategy within this class: 
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MOBILE PUSH Message Selection Strategy: 

• Denote by Mi the message that source i wants to 
spread, i.e. its own message. 

• In every odd time slot: for each sender i, if it has 
an individual message M i7 then i selects Mj for 
transmission; otherwise i randomly selects one of 
the messages it possesses for transmission. 

• In every even time slot: each sender i randomly 
selects one of the messages it has received for 
transmission. 

In the above strategy, each sender alternates between ran- 
dom gossiping and self promotion. This alternating operation 
is crucial if we do not know a priori the number of distinct 
messages. Basically, random gossiping enables rapid spreading 
by taking advantage of all available throughput, and provides 
a non-degenerate approach that ensures an approximately 
"uniform" evolution for all distinct messages. On the other 
hand, individual message flooding step plays the role of self- 
advocating, which guarantees that a sufficiently large number 
of copies of each message can be forwarded with the assistance 
of mobility (which is not true in static networks). This is 
critical at the initial stage of the evolution. 

B. Main Results (without proof) 

Now we proceed to state our main theorems, each of 
which characterizes the performance for one distinct scenario. 



Detailed analysis is deferred to Section III 



1 ) Single-Message Dissemination in Mobile Networks with 
RANDOM PUSH: The first theorem states the limited benefits 
of mobility on the spreading rate for single-message spreading 
when RANDOM PUSH is employed. We note that MOBILE 
PUSH reverts to RANDOM PUSH for single-message dissem- 
ination, and hence has the same spreading time. 



Theorem 1. Assume that the velocity obeys v(n) > y 321 ° s " , 
and that the number of distinct messages obeys k = 6(1). 
RANDOM PUSH message selection strategy is assumed to be 
employed in the unicast scenario. Denote by T S p C (i) the time 
taken for all users to receive message Mi after Mi is injected 
into the network, then with probability at least 1 — n~ 2 we 



have 



Vi, T™(i) = 



log n 
v(n) 



and T s uc (i) = n 



v(n) 



(3) 

Since the single-message flooding time is H(^/n/ log n) un- 
der RANDOM PUSH over static wireless networks or random 
geometric graphs of radius (^J\ogn/n s j J 171, the gain in 

dissemination rate due to mobility is Q (v(njy/n/ log 2 n)). 
When the mobility is large enough (e.g. v(n) = uj y\J 

it plays the role of increasing the transmission radius, thus 
resulting in the speedup. It can be easily verified, however, that 
the universal lower bound on the spreading time is (log n), 
which can only be achieved in the presence of full mobility. 
To summarize, while the speedup il (v(n)y/n/ log 2 n)) can 

be achieved in the regime J 32 log " < v(n) < 0(1), 



RANDOM PUSH cannot achieve near-optimal spreading time 
O (poly (log n)) for single-message dissemination unless full 
mobility is present. 

2) Multi-Message Dissemination in Static Networks with 
RANDOM PUSH: Now we turn to multi-message spreading 
over static networks with uncoded random gossiping. Our 
analysis is developed for the regime where there are fc distinct 
messages that satisfies k = uj (polylog (n)), which subsumes 
most multi-spreading cases of interest. For its complement 
regime where k = O (polylog (n)), an apparent lower bound 
il (yn/polylog(?i)) on the spreading time can be obtained by 
observing that the diameter of the underlying graph with trans- 
mission radius is at least ft (y / n/polylog(n)). 

This immediately indicates a gap f2 (\/n/polylog(ri)) between 
the spreading time and the lower limit k — O (polylog (n)). 

The spreading time in the regime k — uj (poly (log n)) 
is formally stated in Theorem [2] which implies that simple 
RANDOM GOSSIP is inefficient in static wireless networks, 
under a message injection scenario where users start message 
dissemination sequentially. The setting is as follows: (k — 1) 
of the sources inject their messages into the network at some 
time prior to the fc-th source. At a future time when each user 
in the network has at least w = oj (poly (log n)) messages, the 
fc-th message (denoted by M*) is injected into the network. 
This pattern occurs, for example, when a new message is 
injected into the network much later than other messages, and 
hence all other messages have been spread to a large number 
of users. We will show that without mobility, the spreading 
time under MOBILE PUSH in these scenarios is at least of 
the same order as that under RANDOM PUSH which is a 
polynomial factor away from the universal lower limit 0(fc). 
In fact, the individual message flooding operation of MOBILE 
PUSH does not accelerate spreading since each source has 
only 0(poly (logn)) potential neighbors to communicate. 

The main objective of analyzing the above scenario is to 
uncover the fact that uncoded one-sided random gossiping 
fails to achieve near-optimal spreading for a large number of 
message injection scenarios over static networks. This is in 
contrast to mobile networks, where protocols like MOBILE 
PUSH with the assistance of mobility is robust to all ini- 
tial message injection patterns and can always achieve near- 
optimal spreading, as will be shown later. 

Theorem 2. Assume that a new message M* arrives in a 
static network later than other k — 1 messages, and suppose 
that M* is first injected into the network from a state such that 
each node has received at least w = uj (poly (logn)) distinct 
messages. Denote by T* the time until every user receives M* 
using RANDOM PUSH, then for any constant e > we have 



T * >W VlSSlo^ (4) 
with probability exceeding 1 — n~ 2 . 

3 We note that this section is devoted to showing the spreading inefficiency 
under two uncoded one-sided push-only protocols. It has recently been shown 
in )24[ that a network coding approach can allow the optimal spreading 
time 0(fc) to be achieved over static wireless networks or random geometric 
graphs. 
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Remark 2. Our main goal is to characterize the spreading 
inefficiency when each node has received a few messages, 
which becomes most significant when each has received 
9 (fc) messages. In contrast, when only a constant number 
of messages are available at each user, the evolution can 
be fairly fast since the piece selection has not yet be- 
come a bottleneck. Hence, we consider w — w(polylog(n)), 
which captures most of the spreading-inefficient regime 
(oj (poly log (n.)) < w < 9(fc)). The spreading can be quite 
slow for various message-injection process over static net- 
works, but can always be completed within O {k log 2 n) with 

the assistance of mobility v(n) = u> (^^logn/kj regardless 
of the message-injection process, as will be shown in Theorem 
® 

Theorem [2] implies that if M* is injected into the network 

when each user contains uj ( y^ P oiy(iogn) ) messa g es f° r an Y 
e > 0, then RANDOM PUSH is unable to approach the fastest 
possible spreading time 9 (k). In particular, if the message is 
first transmitted when each user contains fl (fc/poly(logn)) 
messages, then at least il (fc 1_e y / n /poly (log n)) time slots 
are required to complete spreading. Since e can be arbitrarily 
small, there may exist a gap as large as fl ( poiy ^ g „) ) between 
the lower limit 9(fc) and the spreading time using RANDOM 
PUSH. The reason is that as each user receives many distinct 
messages, a bottleneck of spreading rate arises due to the 
low piece-selection probability assigned for each message. A 
number of transmissions are wasted due to the blindness of 
the one-sided message selection, which results in capacity loss 
and hence largely constrain how efficient information can flow 
across the network. The copies of each message tend to cluster 
around the source - the density of the copies decays rapidly 
with the distance to the source. Such inefficiency becomes 
more severe as the evolution proceeds, because each user will 
assign an increasingly smaller piece-selection probability for 
each message. 

3) Multi-Message Dissemination in Mobile Networks with 
MOBILE PUSH: Although the near-optimal spreading time 
O (polylog(rt)) for single message dissemination can only 
be achieved when there is near-full mobility v(n) = 

(l/polylog(n)), a limited degree of velocity turns out to 
be remarkably helpful in the multi-message case as stated in 
the following theorem. 

Theorem 3. Assume that the velocity obeys: v(n) = 



log ti 



where the number of distinct messages obeys 



k = uj [poly (log n)). MOBILE PUSH message selection strat- 
egy is employed along with unicast transmission strategy. Let 
T^ p (i) be the time taken for all users to receive message Mi 
after Mi is first injected into the network, then with probability 
at least 1 — n~ 2 we have 



v», ir P M = o(fcio g 2 



(5) 



Since each node can receive at most one message in each 
time slot, the spreading time is lower bounded by 9 (k) for 
any graph. Thus, our strategy with limited velocity spreads 
the information essentially as fast as possible. Intuitively, 



this is due to the fact that the velocity (even with restricted 
magnitude) helps uniformize the locations of all copies of 
each message, which significantly increases the conductance 
of the underlying graph in each slot. Although the velocity is 
significantly smaller than full mobility (which simply results 
in a complete graph), the relatively low mixing time helps to 
approximately achieve the same objective of uniformization. 
On the other hand, the low spreading rates in static networks 
arise from the fact that the copies of each message tend to 
cluster around the source at any time instant, which decreases 
the number of flows going towards new users without this 
message. 

Remark 3. Note that there is a O (log 2 n) gap between this 
spreading time and the lower limit Q(k). We conjecture that 
9 (k log n) is the exact order of the spreading time, where the 
logarithmic gap arises from the blindness of peer and piece 
selection. A gap of 9 (log n) was shown to be indispensable 
for complete graphs when one-sided random push is used 
(23j. Since the mobility model simply mimics the evolution in 
complete graphs, a logarithmic gap appears to be unavoidable 
when using our algorithm. Nevertheless, we conjecture that 
with a finer tuning of the concentration of measure techniques, 
the current gap O (log 2 n) can be narrowed to 9 (logn). See 
Remark 

III. Proofs and Discussions of Main Results 



The proofs of Theorem [TJ3] are provided in this section. 
Before continuing, we would like to state some preliminar- 
ies regarding the mixing time of a random walk on a 2- 
dimensional grid, some related concentration results, and a 
formal definition of conductance. 



A. Preliminaries 

1) Mixing Time: Define the probability of a typical node 
moving to subsquare Ai at time t as 7Tj(i) starting from 
any subsquare, and denote by iii the steady-state probabil- 
ity of a node residing in subsquare Ai. Define the mixing 
time of our random walk mobility model as T m [ x (e) := 
min{£ : \iri(t) ~ TTi\ < e,Vi}, which characterizes the time 
until the underlying Markov chain is close to its stationary 
distribution. It is well known that the mixing time of a random 
walk on a grid satisfies (e.g. see p3] Corollary 2.3] and 1 33 
Appendix C]): 



T mix (e) < cm Hog - + log 



n 



(6) 



for some constant c. We take e = ?i~ 10 throughout this paper, 
so T m j x (e) < com log n holds with cq = 10c. After CQinlogn 
amount of time slots, all the nodes will reside in any subsquare 
almost uniformly likely. In fact, n~ 10 is very conservative and 
a much larger e suffices for our purpose, but this gives us a 
good sense of the sharpness of the mixing time order. See [34 
Section 6] for detailed characterization of the mixing time of 
random walks on graphs. 
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2) Concentration Results: The following concentration re- 
sult is also useful for our analysis. 

Lemma 1. Assume that b nodes are thrown independently into 
m subsquares. Suppose for any subsquare A it the probability 
qAi of each node being thrown to Ai is bounded as 



1 

m 



< 



1 

3m 



(7) 



Then for any constant e, the number of nodes N Ai (t) falling 
in any subsquare Ai(l < i < m < n) at any time t 6 [l, n 2 ] 
satisfies 

a) if b = (mlogn) and b > 32m log n, then 

y(!,t)^<N A(id (t)<^]>i-^ 

bm dm J n- 3 

b) if b — to (m log n), tlien 

~-e)b 



V(»,t) 



< N m (t) < 



> 1 - 



Proof: See Appendix [A] ■ 
This implies that the number of nodes residing in each 
subsquare at each time of interest will be reasonably close to 
the true mean. This concentration result follows from standard 
Chernoff bounds p5] Appendix A], and forms the basis for 
our analysis. 

3) Conductance: Conductance is an isoperimetric measure 
that characterizes the expansion property of the underlying 
graph. Consider an irreducible reversible transition matrix P 
with its states represented by V (\V\ — n). Assume that the 
stationary distribution is uniform over all states. In spectral 
graph theory, the conductance associated with P is |34| 



$ (P) = inf 

BCV,|B|<| 



p 



B 



(8) 



which characterizes how easy the probability flow can cross 
from one subset of nodes to its complement. If the transition 
matrix P is chosen such that 




if j e neighbor(i), 
else, 



(9) 



where rfj denotes the degree of vertex i, then the conductance 
associated with random geometric graph with radius r(n) 
obeys $>(P) — (r(n)) |33| , where r(n) is the transmission 
radius. 



B. Single-message Dissemination in Mobile Networks 

We only briefly sketch the proof for Theorem[T]in this paper, 
since the approach is somewhat standard (see [15|). Lemma 
[TJ implies that with high probability, the number of nodes 
residing in each subsquare will exhibit sharp concentration 
around the mean n/m once n > 32m log n. For each message 
Mi, denote by Ni(t) the number of users containing M, 
at time t. The spreading process is divided into 2 phases: 
1 < Ni(t) < n/2 and n/2 < iV»(i) < n. 

As an aside, if we denote by pij the probability that I 
successfully transmit to j in the next time slot conditional 



on the event that there are 0(rt/m) users residing in each 
subsquare, then if m < 39 1 " - , one has 

O (™) , if Z and j can move to the same 
pij = < subsquare in the next time slot, 

0. else. 

Concentration results imply that for any given time t and any 
user I, there are (^H users that can lie within the same 
subsquare as I with high probability. On the other hand, for 
a geometric random graph with r(n) = •v/l/m = v(n), 
the transition matrix defined in (|9j satisfies = (™) 
for all j inside the transmission range of I (where there are 
with high probability f^M users inside the transmission 
range). Therefore, if we define the conductance related to 

this mobility model as $ (n) = iniBcV,\B\<§ ~ J ^fgf~ Ph » 
then this is order-wise equivalent to the conductance of 
the geometric random graph with r(n) = v(n), and hence 
$(n) = 0(r(n)) = 0(u(n)). 

1) Phase 1: Look at the beginning of each slot, all senders 
containing Mj may transmit it to any nodes in the 9 subsquares 
equally likely with constant probability by the end of this slot. 
Using the same argument as p3| , one can see that the expected 
increment of iVj(f) by the end of this slot can be lower 
bounded by the number of nodes Ni (t) times the conductance 
related to the mobility model $(n) = (v(n)) defined above. 
We can thus conclude that before N(t) — n/2, 

E (Ni(t + 1) - Ni(t) | Ni(t)) > 6iJVi(i)$ (n) = hNi(t)v(n) 

holds for some constant bi and b\. Following the same 
martingale-based proof technique used for single-message 
dissemination in fT3] Theorem 3.1], we can prove that for 
any e > 0, the time T a (e) by which Ni(t) > n/2 holds with 
probability at least 1 — e can be bounded by 



T n (e)=0 



log n + log e 1 
$(n) 



O 



log n + log e 1 



v(n) 



log" 
v{n) 



(10) 
with 



Take e = n 3 , then T.n(e) is bounded by O 

probability at least 1 — n~ 3 . 

2) Phase 2: This phase starts at Tn and ends when Ni(t) — 
n. Since the roles of j and / are symmetric, the probability of j 
having I as the nearest neighbor is equal to the probability of I 
having j as the nearest neighbor. This further yields p^ = pji 
by observing that the transmission success probability is the 
same for each designated pair. Therefore, we can see: 



E(JV,(t + l)-JVi(t) | Ni(t)) 
Ej 



= b 2 (n-N z (t)) 
> b 2 {n-Ni(t))$(n). 



n -Ni(t) 



(11) 



(12) 



Denote by T,2 the duration of Phase 2. We can follow the 
same machinery in fT7) to see: 



\<S>(n)J \v(n) 



(13) 
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with probability exceeding 1 — n 3 . 

By combining the duration of Phase 1 and Phase 2 and 
applying the union bound over all distinct messages, we can 
see that T^(i) = O (^y) holds for all distinct messages 

with high probability. When v(n) > \J 32l ° en , at any time 
instance each node can only transmit a message to nodes 
at a distance at most O (v(n)), and hence it will take at 
least £1 (l/v(n)) time instances for Mj to be relayed to the 
node farthest from i at time 0, which is a universal lower 
bound on the spreading time. Therefore, T^(i) is only a 
logarithmic factor away from the fundamental lower bound 
Q ( J ■ It can be seen that the bottleneck of this upper 
bound lies in the conductance of the underlying random 

network. When v(n) = u y\J^^j> mobility accelerates 

spreading by increasing the conductance. The mixing time 
duration is much larger than the spreading time, which implies 
that the copies of each message is still spatially constrained in 
a fixed region (typically clustering around the source) without 
being spread out over the entire square. We note that with full 
mobility, the spreading time in single message dissemination 
case achieves the universal lower bound 0(log?i), which is 
much smaller than that with a limited degree of velocity. 



C. Multi-message Spreading in Static Networks with RAN- 
DOM PUSH 

The proof idea of Theorem [2] is sketched in this subsection. 
1 ) The Lower Bound on the Spreading Time: To begin our 
analysis, we partition the entire unit square as follows: 

• The unit square is divided into a set of nonoverlapping 
tiles {Bj} each of side length y32 log n/n as illustrated 
in Fig. [2] (Note that this is a different partition from 
subsquares {Aj} resulting from the mobility model). 

• The above partition also allows us to slice the 
network area into vertical strips each of width 
y32 log n/n and length 1. Label the vertical strips as 

{Vi} (l < I < y/n/ (32 log n)j in increasing order from 
left to right, and denote by Ny t (t) and VVy, (t) the number 
and the set of nodes in Vi that contains M* by time t, 

• The vertical strips are further grouped into vertical 
blocks {Vj} each containing logn strips, i.e. = 
{Vi : (j - 1) log 7i+ 1 < I < jlogn}. 

Remark 4. Since each tile has an area of 32 log n/n, con- 
centration results (Lemma [TJ imply that there are O (log n) 
nodes residing in each tile with high probability. Since each 
sender only attempts to transmit to its nearest receiver, then 
with high probability the communication process occurs only 
to nodes within the same tile or in adjacent tiles. 

Without loss of generality, we assume that the source of M* 
resides in the leftmost vertical strip V\. We aim at counting the 
time taken for M* to cross each vertical block horizontally. 
In order to decouple the counting for different vertical blocks, 
we construct a new spreading process Q* as follows. 



Spreading in Process Q*\ 

1) At t = 0, distribute M* to all nodes residing in 
vertical strip V\. 

2) Each node adopts RANDOM PUSH as the message 
selection strategy. 

3) Define T, b = min jt : N v t(t) > oj as the first 

time that M* reaches vertical block VJ b . For all 
I > 2, distribute M* to all nodes residing in either 
V l h _ 1 or the leftmost strip of at time t = . 



It can be verified using a coupling approach that Q* evolves 
stochastically faster than the true process. By enforcing 
mandatory dissemination at every t — T b , we enable separate 
counting for spreading time in different blocks - the spreading 
in V^ +1 after T^ +1 is independent of what has happened in 

blocks, the 



VJ b . Roughly speaking, since there are 



32 log 3 n 



spreading time over the entire region is 6 ^ y poly ( log „) 

times the spreading time over a typical block 

We perform a single-block analysis in the following lemma, 
and characterize the rate of propagation across different strips 
over a typical block in Q* . By Property 3) of Q*, the time 
taken to cross each typical block is equivalent to the crossing 
time in Vf. Specifically, we demonstrate that the time taken 
for a message to cross a single block is at least fl (u; 1-e ) 
for any positive e. Since the crossing time for each block in 
Q* is statistically equivalent, this single-block analysis further 
allows us to lower bound the crossing time for the entire 
region. 

Lemma 2. Consider the spreading of M* over in the 
original process Q. Suppose each node contains at least 
w = to {poly (log n)) messages initially. Define tx '■= w 1 " 6 , 
and define I* = min {/ : Ny t (tx) = O (w t logn)}. Then with 
probability at least 1 — 3n -3 , we have 

(a) I* < | logn; 

(b) Vs (1 < s < /*), there exists a constant C31 such that 

(1 \ s— * 
(c 3 i^logn); (14) 

(c) N v (t x )<log 2 n. 

2 10E ™ 

Sketch of Proof of Lemma [2} The proof makes use of 
the fixed-point type of argument. The detailed derivation is 
deferred to Appendix [B] 

■ 

The key observation from the above lemma is that the 
number of nodes in V s containing M* is decaying rapidly 
as s increases, which is illustrated in Fig. [2] We also observe 
that N Vl (tx) decreases to O (log 2 n) before V\ logn . 

While Lemma [2] determines the number of copies of M* 
inside V\ ~ Vi log „ by time tx, it does not indicate whether 
M* has crossed the block V b or not by tx- It order to 
characterize the crossing time, we still need to examine the 
evolution in strips Vi logn+1 ~ Viogn- Since communication 
occurs only between adjacent strips or within the same strip, 
all copies lying to the right of Vi log „ must be relayed via a 
path that starts from V\ and passes through Vi i ogn . That said, 
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Vj b (containing logn vertical strips) 

Figure 2. The plot illustrates that the number jVy. (tx) of nodes containing 
Mi in vertical strip Vi by time tx is decaying rapidly with geometric rate. 



all copies in Vi logn+1 ~ V logn by 
forwarded (possibly in a multi-hop 
having received M* by tx- If we 
the set of nodes in V\ ogn /2 having 
Q* , then we can construct a process 
M/ (*x) receive M* from the 

Hog n/2 v A ' 

and hence the evolution in Q can be 
£/* by time tx- 



Spreading in Process Q: 



time tx must have been 
manner) via some nodes 
denote by Afv logn/2 (t x ) 
received M* by tx in 
Q in which all nodes in 
very beginning (t = 0), 
stochastically faster than 



1) Initialize (a): at t = 0, for all v £ My 



,(tx), 



2) 



'Vi og „ /2 

distribute M* to all nodes residing in the same tile 
as v. 

Initialize (b): at t — 0, if «i and v-i are two nodes 
in A/,* sucn that V\ and w 2 are less than 



'v losn/2 

logn tiles away from each other, then distribute 
M* to all nodes in all tiles between v\ and V2 in 
^iogn/2- After this step, tiles that contain M* forms 
a set of nonoverlapping substrips. 

3) By time tx = w 1 ~ e , the evolution to the left of 
^iogn/2 occurs exactly the same as in Q* . 

4) At the first time slot in which any node in the above 
substrips selects M* for transmission, distribute 
M* to all nodes in all tiles adjacent to any of these 
substrips. In other words, we expand all substrips 
outwards by one tile. 

5) Repeat from 4) but consider the new set of substrips 
after expansion. 



By our construction of Q, the evolution to the left of 
Vi logn stays completely the same as that in Q* , and hence 
there is no successful transmission of M* between nodes in 
Vi ~ Vi lo g n _ 1 and those in V\ ogn / 2 but not contained in 
■^Vi Therefore, in our new process Q, the evolution 



to the left of Vi i ogn by time tx is decoupled with that to the 
right of Vi logn by time tx- 

Our objective is to examine how likely T 2 * = 
min{t : M* reaches Vi ogn in C*} is smaller than tx- It can 
be observed that any two substrips would never merge before 
T 2 * since they are initially spaced at least log n tiles from each 
other. This allows us to treat them separately. Specifically, the 
following lemma provides a lower bound on T 2 * by studying 
the process Q. 



Lemma 3. Suppose tx = w 1 e and each node contains at 
least w distinct messages since t = 0. Then we have 

4 

T3- 



'(T*<t x ) < 



rr 



(15) 



Proof: See Appendix [C] ■ 
This lemma indicates that M* is unable to cross V Y by 
time tx = w 1_E in Q. Since Q is stochastically faster than 
the original process, the time taken for M* to cross a vertical 
block in the original process exceeds tx with high probability. 
In other words, the number of nodes having received M* by 
tx vanishes within no more than O (log n) further strips. 
Since there are 9 ^^/n/poly(log n)^ vertical blocks in 

total, and crossing each block takes at least (iu 1-£ ) time 
slots, the time taken for M* to cross all blocks can thus be 
bounded below as 



T* = 0, 



poly (logn) 



(16) 



with high probability. 

2) Discussion: Theorem [2] implies that if a message 
M* is injected into the network when each user contains 
il (fc/poly(logn)) messages, the spreading time for M* is 
f2 ^fc 1-e ^/n/poly(logn)^ for arbitrarily small e. That said, 

there exists a gap as large as 57 (y/n/ poly (log n) ^ from 
optimality. The tightness of this lower bound can be verified 
by deriving an upper bound using the conductance-based 
approach as follows. 

We observe that the message selection probability for M* is 
always lower bounded by 1/k. Hence, we can couple a new 
process adopting a different message-selection strategy such 
that a transmitter containing M* selects it for transmission 
with state-independent probability 1/k at each time. It can 
be verified that this process evolves stochastically slower than 
the original one. The conductance associated with the new 

evolution for M* is = \<d(r{n)) = O 

Applying similar analysis as in fl7) yields 

/ poly (log n 



log u 



T l = 



\ $(n) 



O (/cV«poly(logn)) (17) 



with probability exceeding 1 — n~ 2 , which is only a poly- 
logarithmic gap from the lower bound we derived. 

The tightness of this upper bound implies that the propaga- 
tion bottleneck is captured by the conductance-based measure 
- the copies of each message tend to cluster around the source 
at any time instead of spreading out (see Fig. [4). That said, 
only the nodes lying around the boundary are likely to forward 
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the message to new users. Capacity loss occurs to the users 
inside the cluster since many transmissions occur to receivers 
who have already received the message and are thus wasted. 
This graph expansion bottleneck can be overcome with the 
assistance of mobility. 



D. Multi-message Spreading in Mobile Networks with MO- 
BILE PUSH 

The proof of Theorem [3] is sketched in this subsection. We 
divide the entire evolution process into 3 phases. The duration 
of Phase 1 is chosen to allow each message to be forwarded 
to a sufficiently large number of users. After this initial phase 
(which acts to "seed" the network with a sufficient number of 
all the messages), random gossiping ensures the spread of all 
messages to all nodes. 

1) Phase 1: This phase accounts for the first 
c 6 (c m log n + Ch " gn ) log 2 n = 6 (m log 3 n) time 
slots, where cq, cq and Ch are constants independent of 
m and n. At the end of this phase, each message will be 
contained in at least 32m log n = O(mlogn) nodes. The 
time intended for this phase largely exceeds the mixing time 
of the random walk mobility model, which enables these 
copies to "uniformly" spread out over space. 

We are interested in counting how many nodes will contain 
a particular message Mj by the end of Phase 1. Instead of 
counting all potential multi-hop relaying of M,, we only look 
at the set of nodes that receive Mi directly from source i in 
odd slots. This approach provides a crude lower bound on 
Ni(t) at the end of Phase 1, but it suffices for our purpose. 

Consider the following scenario: at time t±, node i attempts 
to transmit its message Mi to receiver j. Denote by Zi(t) 
(1 < i < n) the subsquare position of node i, and define 
the relative coordinate Zij(t) := Zi(t) — Zj(t). Clearly, 
Zij(t) forms another two-dimensional random walk on a 
discrete torus. For notional convenience, we introduce the 
the notation P (•) = P(- | 2^(0) = (0,0)) to denote the 
conditional measure given Zu (0) = (0,0). The following 
lemma characterizes the hitting time of this random walk to 
the boundary. 

Lemma 4. Consider the symmetric random walk Zij(t) de- 
fined above. Denote the set Ahd of subsquares on the boundary 



A hd = \A 



, v TYh . \ , / \/m , 
■I = ! \, ,3) or Ai = U : V 2 ) v/ 



and define the first hitting time to the boundary as T blt — 
min {t : Zij(t) £ -4bd}, then there is a constant Ch such that 



T m < 



Ch log n 



< 



1 



(18) 



Proof: See Appendix [D] ■ 
Besides, the following lemma provides an upper bound on 
the expected number of time slots by time t during which the 
walk returns to (0,0). 




Lemma 5. For the random walk Zij(t) defined above, there 
exist constants C3 and ci, such that for any t < — ^ - : 

E(^l(^(fc) = (0,0)) Z«(0) = (0,0) 

Vfc=i 



Here, 1 (•) denotes the indicator function. 

Sketch of Proof of Lemma [5} Denote by "Hbd the event 
that Zij (t) hits the boundary _4bd (as defined in Lemma |4j) 
before t — m/ (chlogn). Conditional on Zy(0) = (0,0), the 
probability q®j(t) of Zij(t) returning to (0,0) at time t can 
then be bounded as 



q°j(t) ^ P o (^w) + P o ( Z «(*) = (°» °) A 



(20) 



Now, observe that when restricted to the set of sample paths 
where Zu (t) does not reach the boundary by t, we can couple 
the sample paths of Zij(t) to the sample paths of a random 
walk Zij(t) over an infinite plane before the corresponding 
hitting time to the boundary. Denote by 7ibd the event that 
Zij(t) hits ^4bd by t = m/ (chlogn) , then 



»o (Zij(jt) = (0, 0) a n bd ) = p (z i:j (t) = (0, 0) a n bd 

<P = (0,0) 



The return probability obeys P Z l3 {t) = (0, 0) ~ t' 1 for 



a random walk over an infinite plane [36], and Po (Hm) will 
be bounded in Lemma |4j Summing up all q®j(t) yields ( jl9| . 
See Appendix |E] for detailed derivation. ■ 
In order to derive an estimate on the number of distinct 
nodes receiving Mi directly from source i, we need to cal- 
culate the number of slots where i fails to forward Mi to 
a new user. In addition to physical-layer outage events, some 
transmissions occur to users already possessing Mi, and hence 
are not successful. Recall that we are using one-sided push- 
only strategy, and hence we cannot always send an innovative 
message. Denote by Fi (t) the number of wasted transmissions 
from i to some users already containing M, by time t. This 
can be estimated as in the following lemma. 

Lemma 6. For tn = — — . the number of wasted transmis- 

u Ch log n J 

sions Fi it) defined above obeys 



™ , ~ / » % to log n 
®{Fi(t ))<c 5 2-t 



(21) 



for some fixed constant C5 with probability exceeding 1 — 3n 3 . 

Sketch of Proof of Lemma [6} Consider a particular pair 
of nodes i and j, where i is the source and j contains Mj. 
A wasted transmission occurs when (a) i and j meets in the 
same subsquare again, and (b) i is designated as a sender with 
j being the intended receiver. The probability of event (a) can 
be calculated using Lemma [5] Besides, the probability of (b) 
is O (m/n) due to sharp concentration on Na,- See Appendix 

El ■ 

The above result is helpful in estimating the expected 
number of distinct users containing Mj. However, it is not 
obvious whether Fi(t) exhibits desired sharp concentration. 
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The difficulty is partly due to the dependence among {Zi } (t)} 
for different t arising from its Markov property. Due to their 
underlying relation with location of i, Z^ (t) and Z^ 2 (t) are 
not independent either for ji ^ However, this difficulty 
can be circumvented by constructing different processes that 
exhibit approximate mutual independence as follows. 

The time duration [l, cq (com log n + to/ (q, log n)) log 2 n] 
of Phase 1 are divided into c§ log 2 n non-overlapping 
subphases Pi j (l < j < log 2 n) for some constant cq. Each 
odd subphase accounts for m/ (chlogn) time slots, whereas 
each even subphase contains comlogn slots. See Fig. [3] for 
an illustration. Instead of studying the true evolution, we 
consider different evolutions for each subphase. In each odd 
subphase, source i attempts to transmit message Mi to its 
intended receiver as in the original process. But in every even 
subphase, all new transmissions will be immediately deleted. 
The purpose for constructing these clearance or relaxation 
processes in even subphases is to allow for approximately 
independent counting for odd subphases. The duration 
Com log n of each even subphase, which is larger than the 
typical mixing time duration of the random walk, is sufficient 
to allow each user to move to everywhere almost uniformly 
likely. 



Spreading Relaxation Spreading Relaxation 
I Subphase i Subphase i Subphase i Subphase i 



m/(c h logtt) c m\ogn 
slots slots 



2c 6 \og 2 n subphases 

Figure 3. Phase 1 is divided into 2c6 log 2 n subphases. Each odd subphase 
accounts for m/ (chlogn) slots, during which all nodes perform message 
spreading. Each even subphase contains com log n slots, during which no 
transmissions occur; it allows all nodes containing a typical message to be 
uniformly spread out. 



Lemma 7. Set t to be c^ ^comlogn + - t " g - ^ log 2 n, which 
is the end time slot of Phase 1. The number of users containing 
each message Mi can be bounded below as 



Vi, Ni (t) > 32m log n 



(22) 



with probability at least 1 — c-jn 



Proof: See Appendix |G| ■ 
In fact, if to log 2 n <C n holds, the above lemma can be 
further refined to Ni (t) = O (m log 2 n) . This implies that, 
by the end of Phase 1, each message has been flooded to 
(to log n) users. They are able to cover all subsquares (i.e., 
the messages' locations are roughly uniformly distributed over 
the unit square) after a further mixing time duration. 

2) Phase 2: This phase starts from the end of Phase 1 and 
ends when iVj(t) > n/8 for all i. We use t = to denote 
the starting slot of Phase 2 for convenience of presentation. 
Instead of directly looking at the original process, we generate 
a new process Q which evolves slower than the original process 



Spreading in Static Networks 
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Spreading in Mobile Networks 
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Figure 4. The left plot illustrates the clustering phenomena of Ni(t) in the 
evolution in a static network. However, even restricted mobility may allow 
these nodes to spread out within the mixing time duration as illustrated in the 
right plot. 



Q. Define and 5,(4) as the set of messages that node i 

contains at time t in Q and Q, with Sj(f) and §i(t) denoting 
their cardinality, respectively. For more clear exposition, we 
divide the entire phase into several time blocks each of length 
k + Co \ogn/v 2 (n), and use ts to label different time blocks. 
We define A^ B (ts) to denote Ni(t) with t being the starting 
time of time block ts- Q is generated from Q: everything 
in these two processes remains the same (including locations, 
movements, physical-layer outage events, etc.) except message 
selection strategies, detailed below: 



Message Selection Strategy in the Coupled Process Q: 

1) Initialize: At t = 0, for all i, copy the set 5j(t) of 
all messages that i contains to Si(t). Set ts = 0. 

2) In the next co\ogn/v 2 (n) time slots, all new 
messages received in this subphase are immediately 
deleted, i.e., no successful forwarding occurs in this 
subphase regardless of the locations and physical- 
layer conditions. 

3) In the next k slots, for every sender i, each message 
it contains is randomly selected with probability 
1/k for transmission. 

4) For all i, if the number of nodes containing Mj 
is larger than 2Nf(ts), delete M, from some of 
these nodes so that Ni(t) = 2Nf(t B ) by the end 
of this time block. 

5) Set t B <- t B + 1. Repeat from (2) until Ni > n/8 
for all i. 



Thus, each time block consists of a relaxation period and a 
spreading period. The key idea is to simulate an approximately 
spatially-uniform evolution, which is summarized as follows: 

• After each spreading subphase, we give the process a 
relaxation period to allow each node to move almost 
uniformly likely to all subsquares. This is similar to the 
relaxation period introduced in Phase 1. 

• Trimming the messages alone does not necessarily gen- 
erate a slower process, because it potentially increases 
the selection probability for each message. Therefore, we 
force the message selection probability to be a lower 
bound 1/k, which is state-independent. Surprisingly, this 
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conservative bound suffices for our purpose because it is 
exactly one of the bottlenecks for the evolution. 
The following lemma makes a formal comparison of Q and Q. 

Lemma 8. Q evolves stochastically slower than Q, i.e. 

(23) 



(T 2 > x) < P T 2 > x , Vx > 



where T 2 — mm{i : Ni(t) > n/8, Vi} ant/ T 2 = 
mm are f/ie stopping time of Phase 2 

for Q and Q, respectively. 

Proof: Whenever a node i sends a message Mfc to j in Q: 
(a) if Mfe £ Si, then i selects with probability and a 

random useless message otherwise; (b) if ^ 5i, i always 
sends a random noise message. The initial condition Si = 
Si guarantees that 5, C 5j always holds with this coupling 
method. Hence, the claimed stochastic order holds. ■ 

Lemma 9. Denote by ■= min it B ■ N?(t B ) > n 

the stopping time block of Phase 2 in Q. Then there exists a 
constant C14 independent of n such that 



/8,V*} 



T? <41og Ci4 n) <l-n-\ 

Sketch of Proof of Lemma |H We first look at a 
particular message Mi, and use union bound later after we 
derive the concentration results on the stopping time associated 
with this message. We observe the following facts: after a 
mixing time duration, the number of users Ni t A k (i) containing 
Mi at each subsquare Ak is approximately uniform. Since 
Nf(t B ) is the lower bound on the number of copies of 
Mi across this time block, concentration results suggest that 
N i)Ak (t) = SI (NP(t B )/m). Observing from the mobility 
model that the position of any node inside a subsquare is Ltd. 
chosen, we can derive 



E(jVf(t B + l)-JVf(i B ) \K B (t B 



>~N?(t B ) (24) 



for some constant eg. A standard martingale argument then 
yields an upper bound on the stopping time. See Appendix |H| 
for detailed derivation. ■ 

This lemma implies that after at most 4 log Cl4 n time blocks, 
the number of nodes containing all messages will exceed n/8 
with high probability. Therefore, the duration T 2 of Phase 2 of 
Q satisfies T 2 — O (k log n) with high probability. This gives 
us an upper bound on T 2 of the original evolution Q. 

3) Phase 3: This phase ends when iV,*(t) = n for all i with 
t = denoting the end of Phase 2. Assume that N^a- (0) > 
for all i and all j, otherwise we can let the process further 
evolve for another mixing time duration (logn/w 2 (n)). 

Lemma 10. Denote by T 3 the duration of Phase 3, i.e. 
T 3 = min{t :Ni(t)=n \ N,(0) > n/8,Vi}. Then there ex- 
ists a constant C\% such that 



64 

T 3 < — fclogn 

cm 



> 1 



15 
16n 2 ' 



(25) 



Sketch of Proof of Lemma [10} The random push 
strategies are efficient near the start (exponential growth), but 
the evolution will begin to slow down after Phase 2. The 



concentration effect allows us to obtain a different evolution 
bound as 

E{Ni(t + l)-Ni(t)\Ni(t)) 
= E(n-JVi(t)-(n-iVi(t + l))|JVi(t)) 

> §| («-"<(*))• 

Constructing a different submartingale based on n — -/Vj(t) 
yields the above results. See Appendix [I] ■ 
4) Discussion: Combining the stopping time in all three 
phases, we can see that: the spreading time T^ p = 
min{£ : Mi, Ni (t) = n} satisfies 



mp " \ v 2 (n) 



-O (k log n)+0 (k log n) = O (k log 2 n) 



It can be observed that, the mixing time bottleneck will not 
be critical in multi-message dissemination. The reason is that 

the mixing time in the regime v(n) = cj ( -y^f^J is much 

smaller than the optimal spreading time. Hence, the nodes have 
sufficient time to spread out to everywhere. The key step is 
how to seed the network with a sufficiently large number of 
copies at the initial stage of the spreading process, which is 
accomplished by the self-promotion phase of MOBILE PUSH. 

Remark 5. It can be observed that the upper bounds on 
spreading time within Phase 2 and Phase 3 are order-wise 
tight, since a gap of f2(logn) exists even for complete graphs 
p6| . The upper bound for Phase 1, however, might not be 
necessarily tight. We note that the O (log 2 n) factor arises 
in the analysis stated in Lemma [7J where we assume that 
each relaxation subphase is of duration O (m log n) for ease 
of analysis. Since we consider 9 (log 2 n) subphases in to- 
tal, we do not necessarily need 9 (m log n) slots for each 
relaxation subphase in order to allow spreading of all copies. 
We conjecture that with a finer tuning of the concentration 
of measures and coupling techniques, it may be possible to 
obtain a spreading time of 9(fclogn). 

IV. Concluding Remarks 

In this paper, we design a simple distributed gossip-style 
protocol that achieves near-optimal spreading rate for multi- 
message dissemination, with the assistance of mobility. The 
key observation is that random gossiping over static geometric 
graphs is inherently constrained by the expansion property 
of the underlying graph - capacity loss occurs since the 
copies are spatially constrained instead of being spread out. 
Encouragingly, this bottleneck can indeed be overcome in 
mobile networks, even with fairly limited degree of velocity. 
In fact, the velocity-constrained mobility assists in achieving a 
large expansion property from a long-term perspective, which 
simulates a spatially-uniform evolution. 
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Appendix 

A. Proof of Lemma [7] 

Let us look at a typical time slot t at subsquare A.- L . We know 
that 26/3m < E (N A(l) (t)) < 46/3m. For each node j, define 
the indicator variable Xj ^.(t) := 1 {j lies in Ai at i}. Then, 
{Xj t A t {t) : 1 < j < b} forms a set of i.i.d. random variables 
each satisfying ^ < F(X jAi (t) = 1) < ^. 

Define another two sets of i.i.d. Bernoulli random variables 
|x j ub 4i (i) : 1 < j < bj and |x£ Aj (t) : 1 < j < 6 j such that 



4 
3m ' 



3m * 



We also define A* (<) := £ -^.(t) and JV* (t) := 

£ -X^. (*)• The following stochastic orders can be imme- 

i<i<fc 

diately observed through simple coupling arguments 

< (1) (^%)(^< W W, (26) 
i.e. for any positive d, we have 

P (j\# (i) (t) > d) < P (JVa«)(*) > d) < P > d) . 

Applying Chernoff bound yields 

(1 + e)4b 




3m 

(l + e)46 



3m 



< exp 



We can thus observe through union bound that 

P U(i,t) such that N A{l) (t) > (1 + ^ Ah 
<m^p(^ (l)W > (1 + e)46 



3m 



< n exp 



2be 2 
3m 



Similarly, the stochastic order implies that 



3m 



,J L) 



Therefore, we have 



3(i,t) such that N A(i) (t) < 



(1 - e)26 
3m 



< n exp 



3m 



When b = lo (m log n), exp ^— y ^ <C n c holds for any 
positive constant c; when b = (m log n) and 6 > 32m log n, 
taking e = § completes our proof. 



B. Proof of Lemma [2] 

Consider first the vertical strip V; . Obviously, the spreading 
within Vi will be influenced by nodes residing in adjacent 
strips Vj-i.Vj-fi, and V/ itself. Define a set of i.i.d. Bernoulli 
random variables {A„ jt } such that 



F(X v<t = l) 



1 



(27) 



For any v £ U Vl _ 1 {t x )[jMv l { t x)[}Mv l+1 {tx), the proba- 
bility of node v selecting M* for transmission at time t can 
be bounded above by P (X Vyt = 1). Simple coupling argument 
yields the following stochastic order 

N Vl (tx) 

tx 

< 1 (v selects M* for transmission at t) 

t—l Z + l 

«e U MvAtx) 

i=l — l 

t x N Vl _ 1 (t x )+N Vl (t x )+N V[ + 1 (t x ) 



4=1 



v=l 



where ^ st denotes stochastic order. If Nv l _ 1 {tx) 

Nv l+1 {tx) > u> (w e logn), then we have 



N Vl (tx) > — (Nv^itx) + N Vl+1 (t x )) 



<V\[l---)N* Vi (tx)> 



3 1 
2w 
3 1 

2 v? 



{Nvt.Atx) + N Vl+1 (t x )) 



(29) 



3 1 



i+i 



= P^ Vl (tx)>^ I £ tf Vj (tx) 

tx w v i _ 1 (tx)+Afv- ! (tx)+A r v i + 1 (*x) 

t=l u=l 

tx ^v i _ 1 (tx)+JV Vi (tx) + A r v i + 1 (tx) 



AT,, 



(30) 



< 



where ( |30[ > follows from the stochastic order ( |28| i, and the 
last inequality follows from large deviation bounds and the 
observation that 

/tx N V l _ 1 {tx)+Nv l {t X )+Nv l+1 {tx) 

E MT £ X„, t | =w(Iogn). 



t=i 



Besides, if N Vl _ x {t X ) + N Vl+1 (t x ) = 0(w e logn), then 
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we have with high probability. Proceeding with similar spirit gives us: 

„ , for all s (0 < s < L* < log n/2) 
P{N Vl (t x )>3log 2 n} 

< P \N Vl (t x ) > — (Ny^ (t x ) + N Vl+1 {fx)) + 2 log 2 n 1 " ' 



2 , s o 1 N VL ,_ s (t x ) + N VL , +s (t x ) 

w 



4 loff TL 

< _iL_ (Nv^^ + Nv^^itx)) (34) 



<P \ n Vi (tx) > \ ^2 Ny i ) + log2 n \ holds with probability at least l-n~ 4 . By iteratively applying 

\j=i-i ) ) (34 1 we can derive that for any s (1 < s < L*), 



w 

t x N Vl _ 1 (tx)+N Vl (t x )+N Vl + 1 (t x ) 



< 



(31) /41ogn 



| < N Va (t x ) + Nv 2L ,_ B (tx) 

s-1 



< (^) " (Nv^ + Ny^tx)) 
where Y Vit = X v>t - 1/w. Hence, Y Vtt = s _j 
1 - 1/w with probability 1/w. Let - 1 := < [ g n ) (csi^logn) 

W e log 2 n/(7V yi (^)+iV V2 (i x ) + Ar y3 ( ix )) = fi(logn). V ™ e / 

By applying (35J Theorem A. 1.1 2], we can show that there where the last inequality arises from the fact that there are 

exist constants c 2 i and c 2 i such that at most (^l og n) nodes residing in each strip with high 

i og 2„ ^ probability. 

(31| < ( ] < (-£ 2ie )- lo g 2 " lo g lo g" < — (32) This shows the geometric decaying rate of N Vs (t x ) in s. 

I V°g n / 71 Suppose that I* > log n/2, then we have 

Define I* := min{Z : N Vl (t x ) = O (w e logn)} and L* := . iiogn-i 

mm{Z*,0.51ogn}. Since N v ^ +1 (t x ) + Nv^fa) > n v ( tx ) = f^^Y (c 31 Vnlogn) 

Nv L , _ 1 (t x ) — lo (w € logn), we can derive with similar spirit 2 ° g " \ w / 

that <1 = o(w~ e logn), 

o 

Nv L *(tx) < — -( N v L , x(tx) + Ny^^tx)) where contradiction arises. Hence, I* < log n/2 with high 

w t probability, 
with probability at least 1 - n~ 5 . Similarly, if N v ^(t x ) + Additionally, we can derive an upper bound on N Viogn/2 

Nv L * +2 (tx) = u (w e log n), we can derive using the same fixed-point arguments as follows 

Ny L , +1 (tx) <^(N VL »(tx) + N VL , +2 (t x )) N V (t x ) 

w < max | log n, 

with high probability. But if N VL ,(t x ) + N VL , +2 (t x ) = /41ogn V logn ~ 1 1 

O (w £ logn), it can still be shown that ( ^ j (N Vl (tx) + ^v tog „_i(*x)) } 

Nv L , +1 (tx)<Wn< l ^N VL ,_ 2 {t x ) (33) = log 2 n 

by observing that N Vl ,_ 2 (t x ) = u (w* log n). Combining all with Probability at least 1 - n~ 3 . 
these facts yields 

, s . , C. Proof of Lemma \3\ 

2 Define a set of Bernoulli random variables {X t } such that 

< — (Nv L ,^(tx) + N VL ,(t x )) + 

Xt = 1 if there is at least one node inside these substrips 

logn selecting M* for transmission at time t and X t — otherwise. 

w e (-^ 1/ i.*+2 (*x) + Ny L , (tx) + Nv L , _ 2 {tx)) By observing that the size of each "substrip" will not exceed 

3 log n / @ (log 3 n) tiles before T 2 * and that each tile contains O (log n) 

^ we (^»-2(*x) + iV Vi . (t x ) + N VL , +2 (t x )) nodeS; ^ probability P (X t = 1) can be bounded above by 

<^ (AVt ( tx ) + N v , (t x ))+ l-(l-i) e(l0s3n) <^^ for some constant^. This 

w e L +2 inspires us to construct the following set of i.i.d. Bernoulli 

6 logn , , , random variables through coupling as follows: 

-^J2T- {Nv L . + 1 (t X ) +N VL ,_ 1 (t X )) . I *J B f 6 

Simple manipulation gives us [ if X t = 1, then X t = 1; 

JVvb._ 1 (*x)+JVvi ; , +1 (tx) lif^ =0, then = <| 1; w ' p - i=P(^=rj 

0, otherwise. 

(35) 



<^(^._ 2 (tx) + ^ +2 (*x)) 
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That said, 

— \ 1 , with probability C4 ° 1 ° g3 " ; 
1 0, otherwise. 

Our way for constructing {X t } implies that 

t; t; 



(36) 



(37) 



t=i t=i 
Additionally, large deviation bounds yields 

/ tx 



> 



log n 



<P [J2(X t -E(X})> 



\ogn log n 



\t=i 

' tx 



w' 



<P £(X t -E(X t ))> 



Define 

gives 



w e log n 



3c 40 log 3 n 3c 40 log 2 n 




(38) 



, (35 Theorem A.1.12] 



(381 < 



3 1 

< 



which further results in 



'(t; >t x )>i- 



(39) 



(40) 



To conclude, the message M* is unable to cross V-y by time 
tx = w 1_e with high probability. 

D. Proof of Lemma |£] 

Define two sets of random variables {Xij(t)} and {}%(£)} 
to represent the coordinates of Zij(t) in two dimensions, 
respectively, i.e. Zij(t) = (Xij(t), Y«(i)). Therefore, for any 
A E Abd, we can observe 

F (Z ij (t)=A) 
< P f X 4j (t) = |J ^(t) = 

<4P *«(*)> V 



Besides, we notice that \Xij(t + 1) — Xij(t)\ < 2 and 
E (Xij(t + 1) — Xij(t)) — 0, then large deviation results 
implies 



(41) 



for some constant q,. We thus derive Pq (Xy(t) > 



n 6 for any t < 



b I 2hit < 



< 



3t < 



c h log n 

m 
Ch log rt 



Ch log n 



, which leads to 



< 



and 3A E Ad s.t. = A 



< — 4-y/m ■ 4n~ b < — 

Ch log n n 4 



£. Proof of Lemma [5] 

It can be observed that Zij(t) is a discrete-time random 
walk which at each step randomly moves to one of 25 sites 
each with some constant probability. Formally, we can express 
it as follows: for \a\ < 2, |6| < 2: 

P (Zy (t + 1) - Zy(t) = (fl, 6) I Zy(t)) = P | a |,| 6 | (42) 

holds before Zij(t) hits the boundary, where P| a |,|b| are 
fixed constants independent of n, and PlaU&l = !■ 

|a|<2,|6|<2 

We construct a new process Zij(t) such that Zij(t) 
is a random walk over an "infinite" plane with 

P (Zy(t + 1) - Zij(t) = (a, 6) | Z tf (t)) =P|a|,|6|. 

Define the event Hbd = {Tut <m/ (chlogn)} where Tm 
is the hitting time to the boundary as defined in Lemma |4] 
When restricted to the sample paths where Z^ (t) does not hit 
the boundary by t, we can couple the sample paths of Zij (t) 
to those of Zij (t) before the corresponding hitting time to the 
boundary. 

It is well known that for a random walk Zij(t) over an 
infinite 2-dimensional plane, the return probability obeys g?- ~ 
i -1 036). Specifically, there exists a constant c 3 such that 



(43) 



^)=PoM = (0,0) <~c 3 /t 



Hence, the return probability q^(t) = P {Z l] {t) = (0,0)) of 
the original walk Zij(t) satisfies 



$(*) 

= P„(Z«(f) = (0,0)|H bd )P(H bd ) 
+ P (Zy(t) = (0,0)AS M ) 

< Po (Hbd) + Po (z l3 (t) = (0, 0) A n bd 

< Po (H bd )+P (Z tf (t) - (0,0) 
2c 3 



(44) 
(45) 



< 



£3 
f 



< 



where (45 i arises from the coupling of Zij(t) and Zij(t), and 
the upper bound on P (%bd) is derived in Lemma[4] Hence, the 
expected number of time slots in which i and j move to the 
same subsquare by time t(l <t< Cfc ™ g „ ^ can be bounded 
above as 



< 



E K]l^(0 = (0,0)) 

* 1 

2c 3 ^y < C 3 l0gt 



2y(0) = (0,0) 



!=1 

for some constant C3. 
Proof of Lemma [6] 

Define £j (j ) as the first time slot that j receives Mi. A 
"conflict" event related to Mj is said to occur if at any time 
slot the source i moves to a subsquare that coincides with 
any user j already possessing Mi, Denote by Cj(i) the total 
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amount of conflict events related to Mj that happen before 
tit < — P 1 — ), which can be characterized as follows 



i(z ij (k) = o\z tj (t i (j)) = o) 



c h log n / 

E(d(t)) 



E 



t 

E 

deMW fc=ti(j)+i 



E 



< ffll£;i(Zy(A) = 0|Z tf (0) = Q) 

\fe=i > 

< c 3 t log t, 



where (47 1 arises from the facts that Ni(t) < t and Zj. 



is stationary, and (48 1 follows from Lemma [5] Lemma [T] 
implies that there will be more than n/6m users residing in 
each subsquare with high probability. If this occurs for every 
subsquare in each of t slots, whenever i and j happens to stay 
in the same subsquare, the probability that i can successfully 
transmit Mj to j can be bounded above as 6 (1 — 8) ■ Qm/n. 
Therefore, the amount of successful "retransmissions" can be 
bounded as 



E(fi(t))<0(l- 



^(Qtt^c^ (49) 
n n 



for some fixed constant C5. Setting t = to yields 

™ / „ / n n m log n . „ 

E(Fi(t )) < c 5 5-t = o(*o). 



(50) 



This inequality is based on the assumption that Th; t > to 
and that concentration effect occurs, which happens with 
probability at least 1 — 3n -3 . 

G. Proof of Lemma [7] 

Recall that Pi j denotes the j th subphase of Phase 1. There- 
fore, in the 1 st odd subphase, the total number of successful 
transmissions from i (including "retransmissions"), denoted by 
Gj, can be bounded below as 



Gi > C'jraj logrt 



(51) 



with probability exceeding 1 — c-j /n 4 for some constants and 
C7 and £7. This follows from simple concentration inequality 
and the fact that E (Gi) >c(l-8) 6t. It can be noted that Gi 
exhibits sharp concentration based on simple large deviation 
argument. Also, Markov bound yields 



Fi(t ) >65c 5 ^t 



< 



E(^(to)) 



1 



65c^mto log n/n 65 

(52) 

Since the duration of any odd subphase to = mj (chlogn) 
obeys mto log n/n <C to, this implies that the number of dis- 
tinct new users that can receive Mj within this odd subphase, 
denoted by Ni (Pi 1), can be stochastically bounded as 



N t (P M ) < c 8 



logn 



< 



G4 



(53) 



for some constant c%. Note that we use the bound ^ here 
instead of ^ in order to account for the deviation of Gi. 



We observe that there are in total cem\ogn/ch slots for 
spreading in Phase 1, therefore the total number of dis- 
tinct nodes receiving Mj in Phase 1 is bounded above by 
Cem\ogn/ch <C n/2. Mixing behavior combined with con- 
centration effect thus indicates that during any odd subphase, 
there are at least njYlm nodes not containing Mj in each 
subsquare, and each transmitter is able to contact a new 
receiver (who does not have Mj) in each slot with probability 
at least 1/12. This motivates us to construct the following 
stochastically slower process for odd subphase j: (a) there are 
n/2 nodes residing in the entire square; (b) each transmission 
event is declared "failure" regardless any other state with 
probability 11/12; (c) the evolution in different odd subphases 
are independent; (d) other models and strategies remain the 
same as the original process. 

Obviously, this constructed process allows us to derive a 
lower bound on Ni (t) by the end of Phase 1 . Proceeding with 
similar spirit in the analysis for 1 st odd subphase, we can see 
that the number of distinct new users receiving Mj in odd 
subphase j of our constructed process, denoted by iVj(Pij), 
can be lower bounded as 



loe 



< 



1 

G4 



(54) 



for some constant eg. By noticing that ^Ni(P±j) | are mutu- 
ally independent, simple large deviation inequality yields 



,c 6 



3— log n odd subphases s.t. N t (Plj) > c 8 



log 71 



>1 



Taking cq such that cgCg/2 > 32, we can see that 

Ni (P 1<:j ) > 32m log n 



odd subphase j 



(55) 



holds with probability at least 1 — o(n -3 ). Since N (t) 
in the original process is stochastically larger than the 
total number of distinct users receiving Mj in any sub- 
phase £odd subphase j N i ( p ij)> we can immediately observe: 
N (t) > 32m log n by the end of Phase 1 with high prob- 
ability. Furthermore, applying union bound over all distinct 
messages completes the proof of this lemma. 

H. Proof of Lemma [9] 

We consider the evolution for a typical time block [t b , t s + 
1]. During any time slot in this time block, each node I ^ 
A/^ s (ts) lying in subsquare Ak will receive Mj from a node 
in A/j (ts) with probability at least 

/ receives Mi at t from a node in ATj (t b 

> e8 AW*) l > ^NfjtB) 
~ 2n/m k ~ nk 

for constants eg and eg, because concentration results imply 
that: (1) there will be at least A J B (ts)/3m nodes belonging to 
MPits) residing in each subsquare Ay, (2) there are at most 
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2n/m nodes in each subsquare; (3) each successful transmis- 
sion allows I to receive a specific message Mi with state- 
independent probability 1/k. Therefore, each node j ^ J\fi(t B ) 
will receive Mi from a node in J\f B (t B ) by t B + 1 with 

probability exceeding 1 - (l - ^f^ ) > cgNf(t B )/n 

for some constant eg. Since there are in total n — N B (t B ) 
nodes not containing M, at the beginning of this time block, 
we have 



I receives Mi from a node in A/^ 3 ' (t B ) by is + 1 



> 



The fact that T B 2 — T B 2 A n gives 



n-Nf(t B ) )c 9 Nf(t B )/n 



> f JVf (t B ). 

Moreover, after the relaxation period of ci 3 log n/v 2 (n) 
time slots, all these nodes containing Mj will be spread out 
to all subsquares, then similar arguments will hold for a new 
time block. The mixing time period plays an important role 
in maintaining an approximately uniform distribution of the 
locations of each node. Thus, we can derive the following 
evolution equation: 



1 



1 



N B (t B ) 



2 I — — — — - N?(t B ) j (56) 

(57) 

(58) 

n;- (59) 

Ni(t B ) 

for some constant c 13 < 1, where the inequality (58 1 follows 
from the fact N?(t B + l) < 2N B (t B ) guaranteed by Step (4). 

(.2 

(l-cia)" 
E (Zi {it! 



< 



< 



E 



E 



+ l)-N?(t B ) 



Nf(t B + 



l)N?{t B ) 
L)-N?(t B ) 



N B (t B ) 



Nf(t B ) 



N?(t B ) 



1 



Let Tf 2 = min 



|<s : N B (t B ) > n/s\, and define Zi(t B ) = 



/N B (t B ). Simple manipulation yields 



I ) A t B ) I Zi [t B A f B 2 j )<Z t (t B Af B 

which indicates that |Z; (t B A T B 2 ^j ,t B > o| forms a non- 
negative supermartingale. \Ve further define a stopping time 
f B 2 = f B 2 A n, which satisfies f B 2 <n<oo. The Stopping 
Time Theorem yields (371 Theorem 5.7.6] 



ElZilTi 



-i.2 



= A(Z, IT B 2 AT B 2 



<E(Zi(0)) = — . 



E 



((l-cis)- 



max N B 



< 



(*a) 



Wf(0) 



< n. 



(60) 



Set C14 = (1 — c 13 ) 1 , then we have 
p{Tf 2 >41og Cl4 n} = 



'if. 
hi 



2 4 
d? > n 



E 



< 



14 



< 



' [f B 2 < 41og Ci4 n} = P {ff 2 < 41og Ci4 n} > 1 - 



Finally, applying union bound yields 



'{3i : 



^ 2 > 4 log, 



I. Proof of Lemma 

Define T i 3 = min{t : iVj(t) = n} as the stopping time for 
all i. In each time slot, any node I ^ Ni(t) will receive Mi with 
probability at least cis/lQn for a constant cig simply because 
each transmitter in Afi(t) will select Mi for transmission to 
I with probability at least 1/k. This yields the following 
inequality: 

E(Ni(t + l)-Ni(t)\Ni(t)) > S(n -#<(*))• (61) 

Define ^, 3 (t) = (n - iVf^)) (l - fe|)~*, then b y manip- 
ulation we get E(Z ii3 (i + l) | Z ij3 (t)) > Z ij3 (t), which 
indicates that {Zi^(t)} forms a submartingale. Take <4 = 
64fclogri/cis, we have 



E[n-Ni(t 4 )] < 1 



Cis \ * 4 15n 



.'. P(iVi(t 4 ) <n-l) <E[(n-iVi(f 4 ))] < 
Moreover, applying union bound yields 
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' (3i : N t (t 4 ) < n - 1) < nV (N, (i 4 ) < n - 1) < 



15 
16n 2 ' 



Let T3 = max.i ^ 3. Combining the above results, we can see 
that the duration T 3 of Phase 3 satisfies T 3 = O (fclogn) with 
probability at least 1 



15 
Urn 2 
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